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Abstract 

Integrase proteins acting on circular double-stranded DNA often change its topol- 
ogy by transforming unknotted circles into torus knots and links. Two systems of 
tangle equations — corresponding to the two initial DNA sequences — arise when 
modelling this transformation: direct and inverted. 

With no a priori assumptions on the constituent tangles, we utilize Dehn surgery 
arguments to completely classify the tangle solutions for each of the two systems. 
A key step is to combine work of our previous paper [7] with recent results of 
Kronheimer, Mrowka, Ozsvath and Szabo [31] and Ernst [18] to show a certain 
prime tangle must in fact be a generalized Montesinos tangle. 

These tangle solutions are divided into three classes, common to both systems, 
plus a fourth class for the inverted system that contains the sole generalized Mon- 
tesinos tangle. We discuss the possible biological implications of our classification, 
and of this novel solution. 

1. Introduction 

DNA is often circular, and so can be knotted or linked [34]. To aid in essential 
cellular tasks, many proteins have thus evolved to manipulate the geometry and 
topology of DNA. Important examples of these tasks include: replication (creat- 
ing 2 copies from one double strand of DNA), inversion (inverting a subsequence 
of DNA), and integration and deletion (inserting and deleting DNA sequences 
into/from other DNA segments). For instance, certain proteins invert crossings 
of two double strands of DNA to unknot or unlink DNA [3] . 

It is important for biologists to understand how these proteins operate. So 
a model is often generated based on data obtained by experimentally probing a 
particular system. For example, if a particular protein can knot or link circular 
DNA, then by running DNA through a gel, one can determine the crossing number 
of the DNA knots or links. One can then use electron microscopy to identify the 
exact knot or link type [32]. 

By modelling different regions of the (possibly knotted) DNA molecule as tangles, 
we can describe the protein's action as a change in one of the constituent tangles. 
Mathematicians can help biologists by finding all tangle combinations that may 
explain the protein's action. Some of these mathematically possible solutions can 
then be eliminated through biological considerations. 

Based on the biological work of Wasserman and Cozzarelli [50,51], Ernst and 
Sumners developed the tangle model to describe and make predictions — later exper- 
imentally verified — about how the protein Tn3 interacts with DNA [20] . The tangle 
model has since been used to determine specific features of a wide variety of protcin- 
DNA interactions (see for example, [7, 11, 13, 15, 17, 19, 21, 24, 42, 44, 48, 52]). 
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1.1. The integrase family of recombinases. We will focus on one particular 
family of proteins that affects DNA topology: the integrase family of site-specific 
recombinases, also known as the tyrosine recombinases. For an overview of this 
family, see the review of Grainge and Jayaram [25]. The integrase family includes 
Flp, A Int, and Cre. Integrases are involved in a wide variety of biological activities, 
including integrating (i.e., inserting) a virus' DNA into a host cell's DNA. Integrase 
proteins have a common biochemical mechanism and share a subsequence of amino 
acids. Given varying numbers of DNA axis self-crossings (supercoils) these proteins 
can transform unknotted circular DNA into a variety of torus knots or links [25]. 

Members of the other family of site-specific recombinases, the serine recombi- 
nases, act very differently. These proteins, such as Tn3, require a fixed number of 
supercoils. The protein then cuts and rearranges the DNA (occasionally multiple 
times) before releasing it. For several serine recombinases, it has been possible to 
completely solve the tangle model equations (see [20,21,48]). 

In contrast, for integrase proteins, the unbound DNA's configuration varies from 
molecule to molecule, depending on the degree of supercoiling. Further, these 
proteins act only once before releasing the DNA. So, to model the varying con- 
figurations, and its effects on the resulting DNA products after a single round of 
recombination, we must use a larger number of tangles than needed for a serine 
recombinase. Rather than four fixed tangles, a solution to the (direct or inverted) 
system of equations consists of three fixed tangles, P, R and O c , and an indexed 
family of tangles, Oj. In general, there are an infinite number of solutions to such a 
system. The increased complexity of the tangle model for the integrase family has 
thus far prevented a full solution to the tangle model. Previous work found solutions 
by making several assumptions which are thought to be biologically reasonable and 
which lead to considerable mathematical simplification [11,24,44]. 

We previously considered the tangle model for a generic member of the integrase 
family [7] without any a priori assumptions on the constituent tangles. We gave and 
exemplified three (topologically motivated) classes of solutions for Of, O c and P, all 
of which are valid for both systems. We then developed Dehn Surgery arguments 
to eliminate all (in the direct system) or all but one (in the inverted system) other 
possibilities. 

In this paper, we now complete the classification of all possible solutions to the 
systems of equations arising from both the direct and inverted cases. In particu- 
lar, we utilize recent work of Kronheimcr, Mrowka, Ozsvath and Szabo [31], and 
Ernst [18, 19] to determine the fourth solution in the inverted case. In addition, 
we determine all possible solutions for R in both the direct and inverted settings. 
Combining the results derived below with our previous work [7], we obtain a com- 
plete classification of all solutions to the equations arising from either of the direct 
or inverted system: 

Theorem 1. There are three classes of solutions to the equations that model 
integrase-mediated DNA recombination 

N(O) + O c + P) = 6(1, 1) = unknot for k E {0, 1, 2, 3} 
N(Of + O c + R) = b(2k, 1) = T 2 k,2 torus link for direct sites 

or 

N(O k f + O c + R) = b(2k + 1, 1) =T 2 fc+i,2 torus knot for inverted sites 

1. P = (oc), O c , and R are integral tangles. 

2. P and O c are integral, Oj = (oo) for at most one value of k, integral for at 
most two values of k and otherwise is vertical or the sum of a vertical plus 
an integral tangle. R = (oo), integral or vertical plus integral. However if 
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k f = (oo) for some k, then R cannot be (oo), nor can it be integral unless 
k = 0. 

3. P and R are strictly rational, O c is integral, and O h j is integral for at most 
value of k and otherwise strictly rational. 

For the direct system, there are no other classes of solutions. 

For the inverted system, there is precisely one additional class of solutions: 

4. P = ±(p), R = ±(l+p), O c is integral, is the prime Montesinos tangle 
T(|, \,P~ 1) and O k f ^ 2 = ±{-p + ^) or t{p + 2FF2)' where P G {0, !}• 

Theorem 1 is a consequence of Theorem 1 in [7] and Theorem 4.8, as well as 
Theorems 5.2, 5.3 and 5.4. Note that O c is always integral. Also note that the only 
thing that keeps the solutions in Class 4 from being solutions in Class 2 is tangle 
O). 

Note that solutions are unique up to compensating negative and positive vertical 
sums of vertical tangles or horizontal sums of horizontal tangles. See Section 2 for 
a discussion of these equivalence classes. 

The paper is organized as follows: In Section 2 we review some basic facts about 
tangles, four-plats and their corresponding double branch covers. In Section 3 
we provide the biological motivation and background for our work: the action of 
an integrase on DNA with either inverted or direct repeats. We also present the 
generalization of Ernst and Sumners' mathematical model in terms of tangles and 
four-plats [20]. In Section 4 we focus on the inverted system. Here we harness 
recent results of Kronheimer, Mrowka, Ozsvath and Szabo [31] and Ernst [18, 19] 
to limit the possibilites for Oy, and then determine the complete fourth solution. 

In Section 5 we present examples of all classes of solutions to these tangle equa- 
tions, including the novel inverted solution. We conclude with some remarks on 
the biological relevance of our model and solutions, as well as possible directions 
for future (biological and mathematical) work. 

2. Tangles, Four-Plats and their Double Branch Covers 

We begin by recalling a few elementary facts about tangles. (For a more com- 
prehensive introduction to tangles, see [7] and [22]). A tangle T is a pair (B 3 ,t), 
where B 3 is a 3-ball with a given boundary parametrization with four distinguished 
boundary points labelled NW, NE, SW, SE, and t consists of a pair of properly em- 
bedded unoriented arcs with endpoints NW, NE, SW and SE. We say two tangles 
A and B are equivalent if there exists an isotopy taking A to B, which remains the 
identity on dA. 

Tangles can be divided into three mutually exclusive families: locally knotted, 
rational and prime. 

A tangle is locally knotted if there exists a sphere in B 3 meeting t transversely in 2 
points such that the 2-ball bounded by the sphere intersects (ina knotted spanning 
arc. Locally knotted tangles have reducible double branch covers (branching over 
t). 

We note that locally knotted tangles do not occur in our context since the DNA 
molecule starts off unknotted, P, O c and cannot be locally knotted. Suppose 
R were locally knotted, with S C R be a sphere bounding a ball B containing the 
locally knotted arc. Then, since every product is a prime knot, N(Oj + O c + R) — B 
must be an unknotted arc, for all k. But this implies that N(O k + O c + R) = 
N(0 3 j: + O c + R) for j : ^ k, a contradiction. So none of the constituent tangles are 
locally knotted. 

Rational tangles are the second family; they are so called because their equiva- 
lence classes are in one-to-one correspondence with the extended rational numbers 
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(QU {oo}) via a continued fraction expansion, as first constructed by Conway [10]. 
(See [22,28] for nice classifications.) A tangle whose corresponding rational num- 
ber is | will be denoted by (^j- Rational tangles are formed by an alternating 

series of horizontal and vertical half-twists of two (initially untwisted) parallel arcs 
(and hence are freely isotopic to them). Any continued fraction decomposition of 
| = a n + l/(a n -i + ... (l/oi)) yields a finite list of integers [a\, . . . , a n ] which tell 
us how to twist the strands around each other to get a diagram of the tangle. The 
(0) tangle corresponds to two untwisted horizontal arcs (one joining NE to NW 
and one joining SE to SW), whereas the (oo) tangle corresponds to two untwisted 
vertical arcs. The double branch cover of a rational tangle is a solid torus. 

All locally unknotted, non-rational tangles are prime. The double branch cover 
branched over t of a prime tangle is irreducible and has incompressible bound- 
ary [33]. Blciler demonstrated that the minimal prime tangle has a minimal pro- 
jection with five crossings [4]. 

There are several operations one can perform on tangles. We concentrate on 
three. The first operation forms a knot or 2-componcnt link from a given tangle 
A: the numerator closure, N(A). This adds an unknotted arc joining the northern 
cndpoints, and another unknotted arc joining the southern endpoints, or equiva- 
lently, the boundary of A and (0) are identified so that E^ is identified with E( ) 
for E e {NE, NW, SE, SW}. 

The second operation, tangle sum, takes a pair of tangles A, B, and, under certain 
restrictions, yields a third tangle, A + B, by identifying the eastern hemispheric 
boundary disk of A with the western one of B in such a way that NE^ is identified 
with NWb and SE^ is identified with SWb. Note that the (0) tangle is the identity 
under this operation: A + (0) = A. Beware that under tangle addition, we cannot 
distinguish between A + (p) added to B + (—p) and A added to B. Thus although 
tangle summands are written in their simplest form, they are unique only up to an 
arbitrary number of compensating positive and negative horizontal twists. 

The last operation is the vertical sum *, which takes two tangles A and B and, un- 
der certain restrictions, yields a third tangle, A*B, by identifying the southern hemi- 
spheric boundary disk of A with the northern hemispheric boundary disk of B in 
such a way that SE^ is identified with NE# and SW^ is identified with NWb- Note 
that under the numerator closure operation, given two tangles A and B, although 
(A*(l/n)) + B A+(B*(l/n)) in fact AT ((A * (1/n)) + B) ~ N (A + (B * (1/n))). 
Also, N ((A * (1/n)) + (B* (—1/n))) ~ N(A + B). As with horizontal tangle 
sums, we cannot distinguish between these two cases. Thus although tangle so- 
lutions are written in the simplest form for a given equivalence class, they are 
unique only up to an arbitrary number of compensating positive and negative ver- 
tical twists. 

A particular class of prime tangles is obtained by tangle sum of rational tangles 
and will be of interest to us: Montesinos tangles [36]. We will use the notation 

(fr> ff' • • • IT") to denote the Montesinos tangle obtained by the tangle sum of the 

rational (possibly integral) tangles (§^) , (l>7 ) ' ■ • ■ (r 2 ") • ^ tangle that is ambient 
isotopic to a sum of rational tangles called a generalized Montesinos tangle. Since 
the sum of a rational tangle and an integral tangle yields a rational tangle [39], a 
Montesinos (non-rational) tangle must have at least two non-integral summands. 

The numerator closure of a rational tangle yields a four-plat, a knot or 2- 
componcnt link that admits a projection consisting of a braid on 4 strings, with 



one strand free of crossings [1]. Given two rational tangles y^J and y^rj then 
N (^j = N ^21^ iff p = p' and q ±1 = q mod p. Thus given a 4-plat, we can write 
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it as the numerator closure of a rational tangle that is unique up to the relationship 
above. (See [29] for a classification of rational knots.) Schubert showed that all 
four-plats are prime knots [43]. A four-plat obtained by the numerator closure of 



y£j is written as b(p, q). For example, we can write the unknot as 6(1, 1), and the 
trefoil as 6(3, 1). 

2.1. Double Branch Covers. If T is a tangle, then T will mean the double cover 
of B 3 , branched over t. In general, we will write dbc(.fS') to denote the three- 
manifold that is the double cover of S 3 branched over the set K. We now turn our 
attention to the (compact, connected and orientable) three- manifolds that arise as 
double branch covers of tangles or four-plats. 

If P is a rational tangle, then P is a solid torus, which we will denote by Vp. 
Schubert showed that dbc(6(/>, q)) is the lens space L(p,q). Two four-plats b(p,q) 
and b(p', q') are equivalent if and only if their corresponding double branch covers, 
the lens spaces L{p, q) and L(p' ' , q'), are homeomorphic [43], so b{p 1 q) — b(p' ', q') if 
and only if p = p' and q ±1 = q' mod p. (See Rolfsen [40], for a good introduction 
to lens spaces.) 

A summing disk of a tangle (either the western or eastern summing one) lifts 
to an annulus on the boundary of the double branch cover. This annulus can be 
meridional (its core bounds a meridional disk of the solid torus), longitudinal (its 
core intersects a meridian once) or neither (intersects a meridian more than once). 
When two tangles are summed, this corresponds to gluing two solid tori along the 
annuli that are lifts of the summing disks. Hence, a natural subdivision of rational 
tangles arises: We say a rational tangle is integral, and write it as (n), if it consists 
of a series of n horizontal half-twists, where n e Z. We denote this class as Z. 
Integral tangles have summing disks that lift to longitudinal annuli. Similarly, a 
tangle is the infinity tangle, denoted by (oo), if it consists of two vertical parallel 
strands with no twists. The infinity tangle has a summing disk that lifts to a 
meridional annulus. We say a tangle is strictly rational, and denote this class as Q! 
if it is neither integral nor the infinity tangle, and so Q = Q! U Z U {(oo)}. Strictly 
rational tangles have summing disks that lift to annuli that are neither meridional 
nor longitudinal. Within the class of strictly rational tangles we distinguish vertical 
tangles, written as (i) , which consists of a series of n vertical half-twists (|n| > 1). 



We can now describe in full detail a generalization [44] of the original tangle 
model of Ernst and Sumncrs [20] . 

We illustrate our model with a member the integrase family of rccombinases, 
the protein Flp (pronounced 'flip')- Flp has served as the paradigm for site-specific 
recombination [53], and there are a number of proteins (including Cre, and A Int 
acting on LR sites) whose products are, topologically speaking, identical to those 



Roughly speaking, Flp recognizes two copies of a specific DNA sequence, binds 
at these sites, cuts the DNA at the sites, moves the strands in space, reseals the 
break, and releases the DNA. When acting on circular DNA, Flp can change the 
underlying knot type of the DNA, for example turning the unknot into the trefoil 
knot. The distribution of knot /link products should reflect the supercoiling density 
[38] . We call a DNA molecule that has not been acted on by Flp a substrate, and a 
molecule that has been acted on a product. In these terms, the substrate is always 
an unknot and the products are torus knots or links. 

We model each of the substrates and products as the numerator closure of the 
sum of three tangles. Each tangle arc represents a segment of double-stranded DNA. 




3. Biological Motivation and Model 



of Flp. 
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In the tangle model pioneered by Ernst and Sumners [20], the cutting and joining 
of DNA is assumed to be completely localized: two of the tangles are unchanged by 
the action of the protein. In the substrate, the first tangle, P (Parental), represents 
two short identical sites that Flp recognizes and to which it chemically binds and 
then cuts, rearranges and re-seals. This action can be thought of as removing P 
and replacing it with a new tangle, R (Recombinant), in the product. The second 
tangle, O c , represents the part of the DNA that is physically constrained, but 
unchanged, by the protein (O stands for Outside and c for constrained). The last 
tangle, O k , represents the part of the DNA that is free (hence the subscript /) 
from protein binding constraints. O k can vary depending on the amount of DNA 
supercoiling present at the time Flp acts. The superscript k indexes these different 
possibilities. 

In terms of tangles, this amounts to saying that our substrate and products can 
be modelled as: 

N(Oj + O c + P) = substrate (before recombination) 

N(O k + O c + R) = product (after recombination) 

where k e {0,1,2,3}. O k varies as k varies, so we obtain different products, as 
described below. We use O k to mean the part unchanged by Flp, that is, O k = 
0) + O c . When there is no mathematical distinction between Of and O c , we use 
0\ and O2 to represent them interchangeably, i.e., {0\,02\ = {Of,O c }- 

Recall that, in contrast to integrases, proteins in the serine family of recombi- 
nases, such as Tn3, require a fixed number of supercoils before they begin cutting 
and rejoining DNA. Once this requirement is met, they rearrange the DNA, oc- 
casionally multiple times, before releasing it. The corresponding tangle equations: 
from substrate N(Of + O c + P) = K to products N(Of + O c + nR) = K n , were 
first solved (i.e., all constituent tangles have been characterized, given the 4-plats) 
by Ernst and Sumners [20]. Note that the free part, Of, does not vary. This sin- 
gle, fixed Of is what has thus far made the serine recombinase tangle equations 
more tractable than the integrases, whose equations involve a family of tangles Of, 
indexed by k. 

3.1. Two Systems: Inverted and Direct. Flp identifies two short identical 
sequences, called repeats, on a molecule of DNA. These sites are non-palindromic 
sequences, and can thus be given an orientation, and hence on circular DNA, the 
strings can be in head to head (inverted repeats) or head to tail (direct repeats) 
orientation. Action on inverted repeats on a circular molecule of DNA yields a 
knot, and action on direct repeats yields a two-component link. When Flp acts on 
DNA it yields a variety of torus knots (inverted repeats) and links (direct repeats) 
that depend on Of. 

When Flp acts on a DNA molecule with inverted sites, experiments have shown 
that the resulting DNA can be an unknot (with a different DNA sequence), or 
a knot with up to 11 crossings [25]. Crisona et al. have obtained images (using 
electron microscopy) of the simplest products, and has shown that they are the 
torus knots 6(±1, 1) (the unknot), primarily positive 6(3, 1) and exclusively positive 
6(5, 1) [11]. This experimental evidence indicates that Flp begins with an unknotted 
DNA substrate with inverted repeats, 6(±1, 1) and converts it via tangle surgery 
into a torus knot b(±(2k + 1),1), where k G {0,1,2,3}. (The chirality of the 
products for k = 1 and k = 2 has not been determined for all members of the 
integrase family, so we remain in the general situation. Corollary 4.9 considers the 
specific setting for Flp (and A Int acting on LR sites), whose products' handedness 
are known.) 



CLASSIFICATION OF TANGLE SOLUTIONS FOR INTEGRASES 



7 



We thus model the action of Flp on DNA with inverted repeats as: 

Before: N{O k f + O c + P) = b(±l, 1) = unknot, for fee {0,1,2,3} 

After: N(O f + O c + R) = 6(±1, 1) = unknot 

N(O k f + O c + R) = b(±(2k + 1), 1) = torus knot for k G {0, 1, 2, 3} 

When Flp acts on a DNA molecule with direct sites, experiments have shown 
that the resulting DNA can be an unlink, or a 2-component link with up to 10 
crossings [25]. Electrophoretic gels have determined that the simplest products 
are 6(0,1), 6(±2, 1) and 6(±4, 1) [25]. This experimental evidence indicates that 
Flp begins with an unknotted DNA substrate with direct repeats, 6(±1, 1) and 
converts it via tangle surgery into a torus link b(±2k, 1), where k € {0, 1, 2, 3}. We 
thus model the action of Flp on DNA with direct repeats as: 

Before: N{O k f + O c + P) = 6(±1, 1) = unknot, for k G {0, 1, 2, 3} 

After: N(O f + O c + R) = 6(0, 1) = unlink 

N{O k f + O c + R) = 6(±2fc, 1) = torus link for k G {0, 1, 2, 3} 

3.2. Strategy. Given the set of tangle equations above, whose products (4-plats) 
are known, the goal is to determine the constituent tangles. The interplay of tangles 
and four-plats with their corresponding double branch covers is the key to many 
of our results in tangle calculus. For instance, if C and D are tangles, and D is a 
rational tangle, then the dbc(C + D) is obtained by gluing C and D = Vd along 
annuli that are the lifts of their corresponding gluing disks. If D is integral, then 
the gluing annulus is boundary reducible, and dbc(C + D) ~ C. 

The sum and subsequent numerator closure of two tangles C and D induces 
a gluing of the boundaries of their respective double branch covers C and D. If 
N(C + D) yields a four-plat b(p, q), then C Uh D must be the lens space L(p, q), 
where h is the map that takes fj, Q ^, to P^ dD + qp-g D - In particular, when C and D 
are both rational, C = Vc and D = Vd arc solid tori, and they form a Heegaard 
splitting V c U/j Vd of L(p, q). 

Replacing tangle P in N(0 + P) by tangle R to obtain N(0 + R) is called 
tangle surgery. If P and R are rational tangles, then tangle surgery corresponds to 
replacing Vp with Vr in the double branch cover, and thus corresponds to different 
Dehn hllings of O k . In the case of N(O k + P) = 6(1,0), the unknot, then since 
dbc(6(l, 0))is S 3 , and the tangle surgery corresponds to Dehn surgery on a knot 
complement (O k ) in S 3 . If O k is not rational, then the knot is non-trivial. 

Previous work (Ernst and Sumners [20], and the authors [7] via different tech- 
niques for R) proved that P and R are rational for both direct and inverted repeats. 
So the tangle surgery of replacing P with R corresponds to Dehn surgery in the 
double branch covers. 

Thus the strategy is to use restrictions on the type of Dehn surgeries of S 3 = 
dbc(6(l,l)) that yield lens spaces. This in turn restricts the possible tangle solu- 
tions. 

4. The fourth case for inverted repeats 

In [7], we asked whether there were any solutions in the single remaining open 
inverted case: P rational, 0\ integral, O2 prime (and hence O k = O k + O c prime). 
We now can give a positive answer for k = 2 and eliminate this case for all other k. 

In this section, we consider only the inverted system, as we previously completely 
classified the direct system [7] . 
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4.1. Double branch cover of is a trefoil knot complement. We begin by 
restricting the possibilities for k , in part by harnessing a powerful recent result of 
Kronhcimer, Mrowka, Ozsvath and Szabo: 

Theorem 4.1 ([31], Corollary 8.4). If K is a knot in S 3 , such that for some reZ, 
Mk(t) = L(p,q) is a lens space where \p\ < 9, then K must either be the unknot 
or the trefoil knot. 

Theorem 4.2. In the inverted repeats system, O k is rational for k G {0,1,3}. 
Further, if O 2 is not rational, then O 2 must be the complement of a trefoil knot. 

Proof: Recall that since both P and R are rational tangles, then the tangle equa- 
tions N(O k + P) = 6(1, 1) and N(O k + R) = b(2k + 1, 1) correspond to a Dchn 

surgery along core(Vp) in the (possibly trivial) knot complement O k := S 3 \ Vp 
that yields L{2k + l,l) 

The possibilities of O k being satellite (for all k) or torus (for k ^ 2) have been 
eliminated in Theorem 7.1 of [7]. Gordon [23] and Moser [37] have classified all 
surgeries on a generic torus knot T a ^ complement that yield lens spaces: L(p, qb 2 ) 
iff p = qab ±1. A straightforward calculation shows that only L(5,4) can be 
obtained from a torus knot — the trefoil knot T^^- 

It now remains only to rule out the possibility of O k being hyperbolic. Assume 
O k is a hyperbolic knot (K) complement. Then by the Cyclic Surgery Theorem [12], 
the surgery slope must be integral, and in fact must be 2k+l, since H\(MK{s/t)) = 
Z| s |. In our setting, the integral surgery slope 2k + 1 is strictly less than 9, since 

k G {0,1,2,3}. Hence, we can apply Corollary 8.4 of [31], and obtain that O k 
must be a solid torus or a trefoil knot complement, neither of which is hyperbolic. 
Therefore the hyperbolic case is impossible. 

We have thus shown that O k ^ 2 must be the complement of the unknot (a solid 
torus), and therefore O k is a rational tangle. Further, O 2 is either a solid torus or 
trefoil knot complement, and therefore O 2 is either a rational tangle or a tangle 
whose double branch cover is a trefoil knot complement. □ 

4.2. O c is integral and is prime. We can now use the rationality of P and 
O k := O c + O k for k ^ 2 to place restrictions on the summand tangles. 

Theorem 4.3. In the inverted system, assume O 2 — 0\ + O2 has a trefoil knot 
complement double branch cover. Then O c must be integral and O 2 must be prime 

with O 2 is a trefoil knot complement. 

Proof: We first show that our only option in this setting is 0\ is integral and O2 
is prime. 

From Section 6 (as summarized in Table 1) of our previous paper [7], the only 
cases not eliminated are (i) P rational, 0\ prime and O2 integral, and (ii) P 
integral, 0\ — (00) and 2 prime. We now show possibility (ii) cannot occur. 

Theorem 6.8(w) of [7] eliminates the case when O c — 0\ — (00) and O k = O2 is 
prime. Alternately, the case with O c prime and O k - = (00) can occur for at most 
1 value of k, since for k ^ j, N(O k f + O c + R) ^ N(O j f + O c + R). By Theorem 
6.8(viii) in [7], O k must be integral for all other values of k. But then for these 3 
(or 4) values of k, O k :— O k + O c = prime + integral, is prime [39], a contradiction 
to Theorem 4.2. 

Thus the only possibility that can occur is Ox is integral and O2 is prime. Sup- 
pose that O c = O2 is prime and 2 f = 0\ is integral. Then since O c + OH is rational 
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for k ± 2, then O) = (oo) for k = 0, 1 and 3 by Cam Van [39]. But for k ^ j, 
O k ^ J as these are different recombination products. Thus O c must be integral, 
and O 2 prime. 

Since O c is integral, dbc(O c + 2 ) ~ O 2 ~ O 2 , the trefoil knot complement from 
the preceding theorem. □ 

4.3. Of is a Montesinos tangle, and P and R are integral. Montesinos 
links were first considered by Bonahon [6] and Montesinos [36], by using work of 
Tollefson [47] who determined that every involution of a Seifert fiber space with non- 
empty boundary must respect a Seifert fibration. Therefore if the double branch 
cover of a link in S 3 admits a Seifert fibration which is invariant under the covering 
involution the link is either a torus link or what is now called a Montesinos link. 
Ernst [18], with a later clarification by Darcy [13], used related techniques to de- 
termine that any tangle whose double branch covers is Seifert fiber space must be 
a rational or a generalized Montesinos tangle: 

Theorem 4.4 (Ernst [18], and Darcy [13]). If M is a SFS with orbit surface a disk 
and n > exceptional fibers and if M is the 2-fold branch cover of a tangle (B,t), 
then B is a generalized Montesinos tangle. 

Corollary 4.5. In the inverted system, the case where P is a rational tangle, O c is 
an integral tangle, and O 2 is a prime tangle, then O 2 is a generalized Montesinos 
tangle with two non-integral rational tangles. 

Proof. From Theorem 4.3 we have that O 2 is a trefoil knot complement. By 
Theorem 4.4, O 2 then must be a generalized Montesinos tangle with two non- 
integral summands. □ 

The following result was first used without proof by Darcy in [13]. We give a 
short self-contained proof below, since we will also need this for the fourth solution. 

Corollary 4.6. If N(0 2 f + 0) is a four-plat, then Oj is of the form (f , f ) * (^) 
with m e Z. 

Proof. From Corollary 4.5, we know that O 2 is a generalized Montesinos tangle 

with two non-integral rational tangles. Note that (§, §) + (m) = (f , c+ ^ m ), a 
Montesinos tangle with two non-integral summands. 

As a consequence of the above and [14], such a generalized Montesinos tangle is 
of the form 



((&' d) * (m„)) 

^)*(i)) +(m "- l} 



+ K-i) )*... + {mi) 



if n is even, or 



if n is odd. 



1 

mi 



We will show that if n > 1 then N(Oj + 0) ~ N(Oj) cannot be a four-plat. Note 
first that if n is odd, then JV (( (f , |) * (^)) + Ki)) *"- + (™2 ))*(;£-)) = 

^ ((((f ' I) * (m - )) + ( m "-i)) * • • • + ( m 2)j ■ Hence we need only examine the 
case when n > 1 is even. 

Recall that N ({A * (1/n)) + B) ~ TV (A + (B * (1/n))). Hence by a simple in- 
ductive argument we can show that N ^(((f > f) * + * • • • + ( m i) 



N 



((?. i) + ((Wo • • • + (((-3) + ((-1) *(£)))*(£)■■■)*(£)))■ The 
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tangle (((m„_i) . . . + (((m 3 ) + ((mi) * (^))) * (^7) • • •) * (^)) is a ratio- 
nal tangle, isotopic to the tangle given by the vector [mi, m 2 , . . . , m„, 0]. If n > 1, 
this is not an integral tangle, and so TV (( (|, ^) * (^~)) + ( m «-i)) * • • • + ( m i)j 
is isotopic to the numerator closure of the Montesinos tangle with three non-integral 
rational tangles whose double-branch cover is a Seifert fiber space with three excep- 
tional fibers, and thus cannot be a four-plat. Thus n = 1, and so O 2 is of the form 
(§'§)*(m)" Note if m = 1, then O 2 is properly (not generalized) Montesinos. □ 

Theorem 4.7 (Ernst [19]). In the inverted system, if O 2 a Montesinos tangle with 
two non-integral rational summands, then P and R are integral tangles. 

Proof. Recall that N(0 2 + P) = 6(1, 1), and N(0 2 + R) = 6(5, 1), whose double 
branch covers are S 3 and 1/(5, 1) respectively. Now O 2 is a Montesinos tangle with 
two non-integral summands. If P were not integral, then the double branch cover 
of N(0 2 + P) would be a Seifert fibered space with three exceptional fibers, a 
contradiction, as it is a lens space. Similarly with R. □ 

4.4. The Final Inverted Solution. From the previous section, this fourth case 
reduces to P, R and O c are integral and O 2 is Montesinos with two non-integral 
tangles. We now determine all solutions to the tangle equations in this case, by 
applying an algorithm of Ernst whose completeness is guaranteed [19, Theorem 
3.1]. 

Theorem 4.8. In the fourth solution of the inverted case, the tangle solutions are 
P = ±( P ), O c integral, R = ±(l+p), O 2 = t(|,|,P-1) and Of 2 = ±(-p + ±) 
or =F(p + 2FP2). where p G {0, 1}. 

Proof: First note that if P = (p), where p ^ 0, we can assume that P = (0) by 
moving the p horizontal twists into O k . So given a solution P — (0), R = (r), 
and O k , then the solution corresponding to P = (p) would be R = (r + p) and 
O k = 6 k + {-p). 

Case 1: k — 2. From Theorems 4.2 and 4.7, O 2 is a Montesinos tangle of two 
non-integral summands, and P and R are integral tangles. In this section we will 
use the same notation as in the algorithm in [19]: O 2 = (u/v,x/y), P — (m ) and 
R = (m). By moving the mo horizontal twists of P into O 2 , we can set P = (0), 
R = (m), and O 2 = (u/v, x/y, mo). We can now apply the algorithm to determine 
m , m, u, v, x and y . 

For k = 2, we begin with 6(ai,/3i) = 6(1,0) and obtain 6(a2,/?2) = 6(+5,l). 
Ernst's algorithm yields mo = m = —1, u = 1, v = x = 2 and y = 3. So the 
solution is cither R = (-1), and O 2 = (^, |,m ) = (|, |, -1) = {\, or their 
mirror images: R = (1), and O 2 = (— |, — |, 1). 

Case 2: k ^ 2. We will show that if P = (0), then O k = T (^) or ± (^+2)- 
We have that N(O k + P) = N(O k + (0)) = N(O k ) = N (±) and N(O k + R) = 

N(O k =F (1)) = N (±^±i). 

By the correspondence between numerator closure of rational tangles and 4-plats, 

N (|) = AT (2^ if and only if p = p' and g ±1 = 9 ' mod p [10], [43]. This tells us 

that O k is a vertical tangle (|) . The tangle (O k + R) = (i ± l) = (^) ) wnC rc 
the sign is the sign of R. Hence 1 ± q = ±2fc + 1 and so q = 2k or — 2k — 2 when 
R = (1), and q = -2k or 2k + 2 when R = (-1). 

We thus obtain the general solution by converting as described above. □ 
Note that as O c is integral, then O c is either zero or the integral part of Oj. 
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We reiterate that, although solutions are given in their simplest form, they are 
unique only up to compensating positive and negative vertical twists. So for exam- 
ple, as noted by Darcy, P = (0), R = (1) and O 2 = (5, §) * (-5-) is also a solution 
for k = 2 [16]. 

For both Flp and A Int (acting on LR sites), Crisona et al. have numerous 
electron microscope images of two of the products [11]. Most of trefoils (k = 1) and 
all of the pentafoils (k = 2) examined are positive. This chirality further constrains 
the tangle possibilities for Flp and A Int (on LR sites). 

Corollary 4.9. For both Flp and A Int (acting on LR sites), if P = (p), then p G 
{0,-1} andR=(p-l), O 2 = (§,§,-p-l), O 1 = andO k = (-p+&) 

or (-p + 2FP2) fork = or 3. 

Proof: Note that the right-handed products, when P = (0), are given when O 2 = 
(^,|,— 1) and O^ 2 = ^— or O k j^ 2 = ( — Jk)- Therefore, the handedness of 

the product b(+5, 1), as determined through electron microscopy by Crisona et al. 
[11] means that p = (—p), R = (-1 - p) and O 2 = (|, §,p - 1) = (5, for 
p E {0, 1}. By including the negative sign in p we get the resulting P, R and O 2 . 

In the case k = 1, further electron micrographs of Crisona et al. show the 
majority of the product is b(+3, 1). Since R = (—p — 1), the corresponding right- 
handed solution is O 1 = (— p — |). 

No chirality information exists for k = or 3. Therefore there are still two 
possibilities for k — or 3. □ 

5. Solutions 

In this section, we present all tangle solutions for an integrase acting on DNA 
with direct or inverted sites. In particular, we determine what R must be in each 
case. 

We should note that when O k = O c + O k is known to be rational then the 
following theorem allows us to find a values of O k and R that will satisfy tangle 
equations arising from an unknotted substrate and four-plat products. 



Theorem 5.1 ([13] Lemma 14, [20]). With c'd - cd' = 1 : 

j) + ~d< 





-be \ 


)= n (T 


1 \ad'-\ 


-be') 



For instance, in our case we have the unknot as substrate, and torus knots and 
links (that are the numerator closures of integral tangles) as products. 

Theorem 5.2. Given O n , P — | and R are rational tangles such that: 

N(O n + P) = N , the unknot 

N (O n + R) = N (T^-r), the torus knot or link T T („_ t ), 2 
Then O n = & R = ^±- t for rq + ps = 1, for any constant t. 

Proof. We begin by noting: 

Claim. Suppose p and q are relatively prime, and rq + ps = 1. Then r'q + ps' = 1 
if and only if r' = r ± pt and s' — s =p qt. 

Proof of Claim. If r' = r ±pt and s' — s =p qt then checking that r'q + ps' = 1 is a 
simple calculation. Conversely, r'q + ps' = rq + ps, so (r' — r)q = p(s — s'). Since 
p and q are relatively prime, then q\(s — s'), so s — s' = qt for some t. Therefore 
r'q = rq + p(s — s') = rq + pqt so r' = r + pt. Therefore rq + pqt + ps' = 1, i.e., 
rq + p(qt + s') = 1, and so s = s' + qt, and hence s' = s — qt. 
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Applying Theorem 5.1 to N (gEE + f ) and N (g^ + ^) now gives the 
result: In the first case, A ( r ± pn + H ] = JV ( -!- ) . In the second case, note that 
-p(-s ± gi) - (-g)(r ± pi) = 1, so N (g^ + = N (±*-) . □ 

Since for inverted repeats N (O k + R) = T (±(2fc + 1), 2), setting n = 2k + 
1 + t gives a solution for the inverted system. For direct repeats N (O k + R) = 
T (±2fc, 2), so setting n = 2k + t gives a solution for the direct system. 

Let us illustrate this by picking an arbitrary fraction for P. For P = (—) we 
get ±i = 1 + \ so P = [3, 1, 1, 1]. Note 7(-3) + 11(2) = 1, so let r = -3 and 

s = 2. Then we get: 

fe _ r±p(2k + t + l) _ 22fc±llt±ll-3 
~ sT9(2fc + t+l) ~~ 14fc ± 7i ± 7 + 2 

and 

-3 ± lit _ 3 Til* 
~~ -2±7i ~~ 2^7t 
are solutions to the system for the given P, for any constant t. 

Corollary 5.3. If P is an integral tangle (n) then R is either the infinity tangle, 
an integral tangle, or the sum of a vertical and integral tangle. 

Proof. Theorem 5.2 tells us that, since if r = 1 — n and s = 1 then r + ns = 1, R 
must be 

1 — n±nt\ , . ( 1 
= (n) + 



-lit y v ' v*ti, 

If t = or ±2 then i? is an integral tangle. If t = ±1 then i? is the infinity tangle. 
Otherwise, it is the sum of a vertical and integral tangle. □ 
In [7] we showed there are three classes of solutions for P, O k and O c to both 
systems. The example illustrated above lies in Class 3, described below. 
Class 1. P = (oo) and O c and Of are integral. In this case Theorem 5.1 tells us 

that R must be integral: | = -j, so for s = 1 and any r, say r — Owe get R = (^f ) • 
In addition, simply knowing that O k is rational, then Theorem 5.1 gives us that 
O k must be integral: O k = ^ ± ( 2fc + 1+t ) ^ f or the inverted case. 

The simplest example is when R = (0). In that case O k + O c = O k = ±(2k + 1) 
for inverted, or O k = (±2k) for direct [24]. We can, however, choose R to be any 
integral tangle, which corresponds to varying t, and O k changes accordingly. 
Class 2. P and O c are integral, and O k (and hence O k ) is (oo) for at most value 
of k, integral for at most 2 values of k — for all k in both the directed and inverted 
systems — and strictly rational otherwise. When O k is strictly rational, it must be 
either vertical or the sum of a vertical and an integral tangle [7] , possibly of different 
signs. (Note that if the vertical and the horizontal twists have opposite sign the 
tangle is said to be in "non-canonical" form.) As shown in Theorem 5.3, R must be 
the infinity tangle, integral, or the sum of a vertical and integral tangle. However, 
R and O k for some k cannot both be (oo). 

The simplest example in this class is when P = (0), R = (oo) and O k = 
(±l/(2k + 1)) for inverted, and 0° = (oo) and O k = (±l/2k) (k > 0) for di- 
rect, as first considered in [11]. This is biologically equivalent to the first example, 
in that the tangle surgery consists in interchanging the two tangles with no cross- 
ings. In general two solutions are biologically equivalent if their three-dimensional 
arrangement is the same, but have different projections. See our previous paper [7] 
and [49] for further discussion of biological equivalence. 
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Another example, which is biologically non-equivalent, is when P = (±2). In 
this case R = (±1) and 0° is infinity for inverted repeats and O and O 1 are 
integral for direct repeats; all other O k, s are strictly rational. For direct repeats 
O = O 1 = (t3) or O 2 = (t|), and O 3 = (t|). For inverted repeats 

0° = (oo) or ( T §), O 1 = (T|), O 2 - and O 3 = ( T f ). 

If a solution has O fe = (oo), for some k, as in the second example, then in fact 
it must satisfy several conditions: 

Theorem 5.4. Given a solution from Class 2 then: 

(i) If3i s.t. Of = (oo), then R must be vertical or (±1). 

(ii) IfBi and j s.t. Of = (oo) and O f is integral, then P E {0, ±2}. 

(Hi) IfBi s.t. Of = (oo) and P ^ (0), then i = 0. If in addition Of is integral, 
then j = 1 . 

Proof. The first two items were shown in [7]. The third item can be proved 
by looking at O k and R, which can be re- written as R = (j—^j^-^J and O k = 

(-P + 8T t T (2fc+i) ) for invcrted > and ° k = (-P + , T 4(2fc) ) for dircct ' whcre P = 
(p) 7^ (0). As R is vertical, this means that either s =F t = or p(s =F t) = 2. 
If s =F £ = 0, then, if O l = (oo), that 2z + 1 = for inverted, or 2i = for 
direct. The first is impossible, so i = for dircct. If further, is integral, then 
±(2j + 1) = ±1 for inverted, or ±2j = ±1 for direct. The first case gives j = 0, 
the second is impossible. The second case is similar: Since in this case p = ±2, this 
means s =F t = ±1. So 2i = =pl for dircct, which is impossible, and 2i + 1 = =pl 
for inverted, so i = 0. If in addition is integral, then ±1 =p (2j + 1) = ±1 for 
inverted, which is impossible, and ±1 =p (2j) = ±1, so j = 1, or j = which cannot 
occur since i = 0, so j = 1. □ 
Class 3. P and are strictly rational and and O c is integral. In this case R is 
also strictly rational: given O k = (f ), with b ^ ±1, if R = (c) then N(O k + R) = 
N (2±^) = AT (^i), which implies & = ±1, a contradiction. 

An illustration of the last class of solutions was given above, following Theo- 
rem 5.2. This class includes many different (biologically non-equivalent) possible 
actions. 

Class 4 (inverted case only). In the preceding section wc demonstrated that 
there is an additional solution for inverted sites: R, O c and P — (p) are integral, 
and 2 f is Montesinos with two non-integral tangles, and in fact is the minimal 
prime tangle [4]. In particular, P = ±(p), R = ±(1 + p) and O c are integral, O 2 is 

the prime Montesinos tangle T(^, f ,P~ 1) and O^ 2 = ±(— p+ ^) or =f(p+ 2 k+2 ) ' 
where p £ {0, 1}. This is the only case in which one of the constituent tangles is 
not rational, and in this case it is the smallest prime tangle (plus possibly a single 
horizontal twist). 

6. Conclusion 

We have completely classified all possible tangle solutions corresponding to a 
generic member of the integrase family of recombinases acting on DNA with either 
direct or inverted sites. We proved there are exactly three classes of solutions in 
the direct system, and exactly four in the inverted. 

The biological assumption is that the protein mechanism is consistent for a given 
protein across a variety of substrates. The integrase family of recombinases is 
forgiving of substrates — these members will perform recombination between two 
sites on the same (circular, as discussed here, or linear) or different (circular and/or 
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6(1,1) 6(5,1) 6(7,1) 



Figure 1. The fourth case for inverted repeats: 

P = (0), R = (-1), 0° = (oo), O 1 = K), O 2 = (I, -§), O 3 = (-i) 

linear) molecules. However, if an integrase requires a particular alignment of the 
two sites when they are brought together, then a substrate with inverted sites may 
have a different number of crossings introduced into O than a substrate with direct 
sites. (For example, it was shown that Flp aligns the sites in antiparallel, and so 
in the simplest case, O c = (1) for inverted and (0) for direct [24].) Biologically, 
the unequal number of solutions between the inverted and direct cases means that 
the fourth case is not likely, as this would mean that O k differs not only in the 
number of crossings but in the actual type of tangle (prime versus rational) . This 
would in turn imply a different mechanism for bringing the sites together depending 
whether they are in direct or inverted repeats (only for one product, corresponding 
to k = 2). A small protein being able to "sense" the global orientation (inverted 
versus direct) of the two sites would be unusual. However, there are examples of 
proteins which act in response to global characteristics, notably Topoisomerase II, 
which preferentially changes crossings to unknot DNA [41]. 

6.1. Future Directions. We conclude with several observations, and avenues for 
future biological and mathematical research. 

Electron microscopy images by Crisona and colleagues determine that for Flp 
in the cases k = 1 and k = 2, the products are in fact torus knots [11]. For the 
related recombinase, A integrase acting on LR or PB sites, electron microscopy 
has shown that the products are almost exclusively positive torus knots (inverted) 
and negative torus links (direct) [11], [45]. Biologically, one would expect a fixed, 
precise mechanism for this family of recombinases, which would predict that for 
Flp and other integrases, every product should be a torus knot or link. However, 
although one can experimentally demonstrate that higher-crossing products exist 
and determine their respective crossing number, their precise knot /link type has 
not been experimentally verified for any other integrase family member. As the 
crossing number increases, resolving the precise knot type becomes more crucial 
(and difficult) since the number of knots with a given crossing number increases 
dramatically — for instance, there are 1,701,936 knots with < 16 crossings [27]. 
Question 1: For Flp and other integrases (not A Int), can we obtain experimen- 
tal confirmation that every product is of the form b(2k, 1) (direct) or b(2k + 1, 1) 
(inverted)! 

If they are, one might be able develop similar Dehn surgery arguments such 
as the result from [31] used here to restrict O k . For example, for k < 10, we 
can eliminate the possibility of O k having as double branch cover a satellite knot 
complement by work of Bleiler and Lithcrland [5], and, in many cases, a torus knot 
complement by Moser and Gordon [23,37]. So in many cases, the first step would 
be to consider O k such that O k is a hyperbolic knot complement. 
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Second, we consider the chirality of the resulting products. Crisona et al. have 
characterized the first two nontrivial products of Flp-mediated inversion as almost 
exclusively positive torus knots [11]. 

Question 2: Can we determine the chirality of any of the Flp deletion products, 
or the higher crossing inversion products? 

If so, the classes of solutions could be tightened signifcantly by removing the 
possibility of mirror images, as in Corollary 4.9. 

Third, although the third class of solutions is mathematically possible, biological 
considerations such as DNA's stiffness impeding a high number of crossings, make 
them biologically unlikely. For this reason, biologists often assume in the tangle 
model that P and R each have or 1 crossings. 

Other work, notably [11] and [44], have incorporated a number of biologically 
reasonable assumptions into the tangle model which reduce the number of putative 
tangle solutions. For example the Generalized Random Collision assumes that the 
original DNA is exclusively negatively supercoiled {i.e., Oj — (n) or (— ^) for 
n > 0), which then in turn restricts O c and P [11], [44]. These assumptions are 
biologically reasonable, and preclude tangle solutions such as those of Class 3. But 
topologically, the more exotic cases cannot be excluded. 

Question 3: Can the third class of solutions be eliminated or restricted experimen- 
tally, or computationally using models (e.g., [35]) of DNA flexibility? 

If the number of crossings of the constituent tangles can be bounded with any 
degree of certainty, this reduces the solutions from an infinite number to a mere 
handful. For example, we consider the serine recombinase Xer acting on circular 
DNA with direct psi sites, whose corresponding tangle equations are N(0 + P) = 
6(1, 1) and N(0 + R) = N (f) = 6(4, 1) [9]. Assuming all tangles are rational and 
P= (0),and so = (±), then Darcy showed that /,' •• j-.^.^-. 3q 4^L_ ] } [13]. 
Additionally, Vazquez et al, by carefully analyzing the biological data, and assuming 
any nontrivial topology is in O c , show P is trivial (and hence in their setting can 
be chosen to be (0)), and show O is rational using techniques similar to [7]. Further 
they assume R is integral or 00, and show there exist 3 solutions: O = (^p ), R — 
(— 1) or O = (^r-),R = (+1) or O = (^-),R = (00), which can be biologically 
equivalent [49]. 

Additionally given torus link substrates constructed by A Int, Xer- mediated re- 
combination results in more complex knots: N(C>2 + P) = N (|), and N(02 + R) = 
seven-crossing product (knot), and N(0 3 + P) = N (y ) N(0 3 + R) = nine-crossing 
product [2] . Darcy also uses this information to analyze how differing seven-crossing 
products reduce the number of putative solutions [13]. 

Question 4 Can the precise type of knot resulting in the action of Xer on N (|) 
be determined? Can similar techniques to those used here be used to examine the 
Xer system in their full generality, without simplifying assumptions ? 

Finally, we reiterate that, although we have motivated our work by considering 
Flp, our results can be applied to any protein in the integrase family, such as Cre 
or A Int (acting on LR sites), whose products are the above torus knots or links. 
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